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As coronavirus disease 2019 (COVID-19) cases arose globally, active case 
finding by performing throat swab test proposed high risk for the healthcare 
workers. Saliva had recently been reported to show positive detection means 
for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and 
proposed advantages of self-collection, less requirement of transport media, 
and reduced nosocomial transmission risk. However, support evidence 
regarding its diagnostic value was still lacking and varied widely in 
specimen collection method. This systematic review aimed to assess the 
diagnostic value of salivary specimens (SS) for COVID-19 detection using 
reverse transcription polymerase chain reaction (RT-PCR) assay compared 
with throat swab specimens (TSS), while putting into consideration 
confounders such as patients’ initial condition, specimen collection method, 
and transport media used. Six databases were used for identifying relevant 
studies. Final search yielded 19 eligible studies which was reviewed based 
on the major outcome: diagnostic agreement, sensitivity & specificity, and 
viral load comparison. The use of SS as an alternative to TSS showed to be 
promising although specimen collection method needed to be standardized. 
SS was comparable to TSS in detecting COVID-19 using RT-PCR assay, 
especially in symptomatic or confirmed cases. More Randomized controlled 
trials (RCTs) were still needed to clearly demonstrate the ability of SS to 
capture asymptomatic cases in the setting of mass surveillance, where 
patients would self-collect the specimen at ease. 
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1. INTRODUCTION 


The coronavirus disease 2019 (COVID-19) pandemic caused by severe acute respiratory syndrome 
coronavirus 2 (SARS-CoV-2) has caused a huge disease burden globally [1]-[3]. As new variants of 
SARS-CoV-2 arise, increased numbers of infection and deaths at unprecedented rate are being recorded 
worldwide [4]-[7]. Reinfection cases were not uncommon [8]-[11], even after COVID-19 vaccination 
program are being rolled out in many countries [12], [13]. Active case finding through contact tracing and 
performing testing are mandatory to prevent mass transmission since many active cases are asymptomatic 
[14]-[16]. Several methods have been proposed to provide early detection of COVID-19, i.e., through 
detection of viral ribonukleat acid (RNA) [17], [18], viral antigen [19], [20], and serum antibody [21], [22]. 
Viral RNA detection using nucleic acid amplification test (NAAT) is known to have low limit of detection 
with excellence analytical specificity and among many different types of NAATs, reverse transcriptase 
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polymerase chain reaction (RT-PCR) assay has been considered as the golden standard test for laboratory 
diagnosis of SARS-CoV-2 infection [23]. 

Throat swab specimens (TSS), which includes nasopharyngeal swabs (NPS), oropharyngeal swabs 
(OPS), dan naso-oropharyngeal swabs (NOS) are widely used and recommended as a standard specimen for 
the respiratory virus diagnosis, including SARS-CoV-2 [24]. However, performing throat swabs requires 
specific skills. In the setting of mass surveillance and the need for serial examinations, it would be 
nonbeneficial for the healthcare workers who are involved. Another issue of specimen collection device 
shortage due to high load of swab examinations calls for new means of COVID-19 specimen collection 
which requires simpler procedure and less expensive specimen collection devices. 

Using salivary specimens (SS) for SARS-CoV-2 detection provides new interest as saliva has 
recently been reported to show positive detection means for SARS-CoV-2 [25], [26]. Some advantages of 
using SS for SARS-CoV-2 diagnosis are multiple specimens self-collection and reduced need for healthcare 
professional during specimen handling, which reduced nosocomial transmission risk [24]. Saliva collection 
method also reduces discomfort in serial examination caused by repetitive swabbing procedure as well as 
provides less requirements for transport medium due to specimen stability [27], [28]. However, aside from 
the advantages shown by this means of specimen collection, support evidence regarding its diagnostic value 
is still limited and varied. Different literatures showing distinct SS collection methods provided different 
results in comparison to TSS as the standard reference. We conducted a systematic review of literatures 
reporting the use of saliva as a specimen for RT-PCR assay in detecting SARS-CoV-2 in comparison to TSS, 
while taking into consideration the various methods used for SS collection. 


2. RESEARCH METHOD 

The preferred reporting items for systematic reviews and meta-analyses (PRISMA) Guideline for 
reporting systematic reviews were used as general guide to conduct this systematic review [29]. The research 
question for this study is “How is the diagnostic value of salivary specimens for COVID-19 detection using 
RT-PCR assay compared with throat swab specimens?” 

Six databases were used for identifying relevant studies, which consisted of Scopus, ProQuest, 
ScienceDirect, CINAHL, EBSCO, and PubMed. English articles were included and reviewed by three 
independent researchers. The terms used as keywords were saliva, salivary, COVID, SARS-CoV-2, 
detection, diagnosis, diagnostic, and RT-PCR. Inclusion criteria included studies between year 2020 and 
2021 which evaluated the diagnostic value of SS examined using RT-PCR assay in comparison to any form 
of TSS (NPS, OPS, or NOS) for SARS-CoV-2 detection. Both published and unpublished studies (if any) 
were included. Study designs were all observational (cross-sectional, prospective, and retrospective study, if 
any). Exclusion criteria included studies which did not use RT-PCR assay for reference testing; studies which 
used specimens other than TSS as comparators; and study protocols and conference abstract, of which full 
text articles were not available online. Outcome measure was the diagnostic value of SS RT-PCR assay in 
comparison to TSS, included but not limited to diagnostic agreement, sensitivity, specificity, and viral load 
comparison denoted in form of cycle threshold (Ct) value. 

All studies were critically appraised using methodological quality assessment tools specific to each 
study design. The AXIS critical appraisal tool for cross-sectional studies [30] was used for appraising 
observational cross-sectional studies. The Newcastle-Ottawa quality assessment form for cohort studies was 
used for appraising cohort studies. Data and information collected from the reviewed studies were aggregated 
and reported in narrative manner. 


3. RESULTS AND DISCUSSION 

We found 228 non-duplicated records from six different databases. Preliminary screening excluded 
195 records, yielded a total of 33 records to be assessed for their eligibility through full text. A total of 
14 records were further excluded due to unsuitable article types (review articles, letters to editors, and 
research proposal) and different study objectives, resulted in 19 records which were included within the final 
review as shown in Figure 1. 


3.1. Participants and study characteristics 

The 19 articles included in this review consisted of 13 cross-sectional studies [31]-[43] and six 
cohort studies [44]-[49] where specimens were collected and/or tested repetitively following certain period 
of time to obtain comparative results on different symptom onset duration. A total number of 8,227 
participants were included within these studies. Most of the participants were adults with age ranging from 
18-82 years old, except for four studies [35], [39], [40], [44] which also included pediatric patients (age four 
years and above). 
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All studies were comparing the diagnostic value of SS SARS-CoV-2 RT-PCR results with any 
forms of TSS (NPS, OPS, and/or NOS). Several studies were also comparing SARS-CoV-2 RT-PCR results 
from other respiratory tract specimens such as tracheal aspirate and sputum [45], cerumen and tears [34], and 
nasal swabs [47], [48] in additions to SS. 


Records identified through database searching: 


Records identified through other sources 
CINAHL 7, EBSCO 158, PubMed 140, Science Di- 


{hand searching) 


rect 30, Scopus 90, ProQuest 83 
(n= 0) 


(n= 508) 


Records identified 
(n= 508) 
Records after duplication removed 
(n= 228) 
Records screened Records excluded 
(n= 228) {n= 195) 


Identitication 


Full text excluded 
{n= 14) 


Full text assessed for eligibility 
Unsuitable article type (8) 
Different study objectives (6) 
Studies included in synthesis 


Figure 1. PRISMA flowchart of screened and included studies 
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Reasons for exclusion: 


3.2. Quality assessment 

The 13 cross-sectional studies were appraised using the AXIS critical appraisal tool for 
cross-sectional studies, yielding fairly good quality. Four studies were considered to have less appropriate 
study design to meet the study objective by not providing control group to their study population [34], [35], 
[42], [43], eleven studies were unable to provide complete information regarding non-responder or missing 
data [31]-[36], [38]-[41], [43], and five studies were unable to provide information regarding ethical 
approval and/or may have possible conflict of interest [31], [32], [37], [38], [41]. 

The six remaining cohort studies were assessed using Newcastle-Ottawa quality assessment form for 
cohort studies. All six studies showed poor quality due to inability to provide information regarding 
comparability. No information was provided regarding control of confounding factors. Furthermore, four 
[44], [46], [47], [49] out of six studies included only confirmed case of COVID-19, provided no negative 
cases as non-exposed group. Among these six studies, only one study [47] denoted independent blind 
assessment of the outcome, while the other five provided no information at all. 


3.3. Diagnostic agreement comparison between SS and TSS 

All 19 studies included within this review showed similar characteristics in comparing performance 
of SS to TSS in detecting COVID-19 using SARS-CoV-2 RT-PCR assay. Each study may be unique in 
presenting the outcome, yet still comparable to each other. We grouped studies with similar outcome 
indicator to be reviewed together based on the diagnostic agreement, sensitivity and specificity, also viral 
load comparison. 

We found 13 studies measured percent agreement between SS SARS-CoV-2 RT-PCR assay and 
TSS. Variations occurred between SS collection methods, which provided different types of SS collected and 
viral or universal transport media used, with additional information on patients’ clinical condition 
(symptomatic or asymptomatic). This information is summarized in Table 1. 
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Table 1. Level of agreement shown in individual studies 


Author Type of SS Transport Clinical Percent Cohen’s kappa Level of 
collected media condition agreement coefficient (x) for OA agreement 
[31 Induced by No Symptomatic PPA 100%. NA Strong 
coughing NPA 99.4% 
[32 Pure, by spitting UTM Symptomatic PPA 84.8%. 0.851 (95% CI, 0.745- Strong 
NPA 97.8%. 0.958) 
OA 94.4% 
[33 Induced by VTM Symptomatic PPA 83.43%. 0.814 (95% CI, 0.775- Strong 
coughing OA 91.25% 0.854) 
[34 No detailed VTM Both PPA 76.3% NA Moderate 
information 
[35 Pure, by spitting No Both PPA 90.56% NA Strong 
[36 Pure, by drooling No Symptomatic PPA 81.1%. NA Strong 
NPA 99.8% 
[37 No detailed No Both OA 98.4% 0.91 (95% CI, 0.82- Strong 
information 0.96) 
[39 Pure, using nylon VTM Both PPA 26.3%. 0.252 (95% CI, 0.09- Weak 
swab OA 55% 0.42) 
[40 Pure, by chewing No Symptomatic OA 83.6% 0.58 (95% CI) Weak 
cotton pad 
[41 No detailed No Symptomatic PPA 98%. 0.96 (95% CI, 0.92- Strong 
information OA 98.3% 0.99) 
[44 Pure, by spitting No Symptomatic OA 86.7% to 0.625-0.883 (95% CT) Moderate 
96.4% 
[45 Pure, by spitting No Symptomatic PPA 60% NA Weak 
[48 Enhanced, by No Both OA 76.2% 0.537 (95% CI) (CDC- Weak 
clearing throat (CDC-LDT). LDT). 
and nose 80.1% 0.602 (95% CI) 
(Fortitude 2.1) (Fortitude 2.1) 


NA=not available, UTM=universal transport media, WTM=viral transport media, PPA=positive percent agreement, NPA=negative 
percent agreement, OA=overall agreement. 


Of all 13 studies showing percent agreement results, eight studies provided information regarding 
overall agreement (OA) with Cohen’s kappa Coefficient (x). Among these studies, only four studies [32], 
[33], [37], [41] showed strong to almost perfect agreement (0.80<K<1). Study conducted by Uwamino et al. 
[44] showed moderate agreement (0.60<K<0.79) for specimens collected long after symptom onset, but 
showed strong agreement for specimens collected within ten days of symptom onset. The other three studies 
[39], [40], [48] showed weak agreement (0.40<K<0.59). Five remaining studies provided information 
regarding positive percentage agreement (PPA) with/without negative percentage agreement (NPA). Strong 
agreement was shown by three [31], [35], [36] out of the five studies, assuming less than 20% disagreement 
(PPA>80%) was considered sufficient [50]. Of all 13 studies included, only five studies included 
asymptomatic cases, of which two showed weak agreement [39], [48] and one showed moderate 
agreement [34]. 

We put into account the difference between studies regarding the means of SS collection method, 
the use of transport media, and patients’ clinical condition. The idea of proposing SS as diagnostic specimen 
for COVID-19 detection laid on its simple specimen collection method, specimen stability without transport 
media, and aim for mass surveillance especially for asymptomatic cases. Induced saliva provided more 
complex procedure and might mask the study results as lower respiratory tract fluid might contaminate the 
SS. The use of transport media (UTM or VTM) omitted the initial purpose of using SS to resolve issue of 
transport media shortage. Study conducted by Lépez-Martinez et al. [27] indicated the viral genome is stable 
and endures perturbations within saliva as biofluid. Included only symptomatic cases as study population 
might also mask the ability of SS to detect asymptomatic cases in the setting of mass or serial surveillance as 
in healthcare workers. 

Considering only the pure (not induced or enhanced) saliva without the use of transport media to 
achieve the aim of this SS collection method, there are only four studies to compare. Three studies showed 
moderate [44] to strong [35], [36] agreement with one study showed weak [45] agreement. Note that from 
these four studies, only one study [35] included asymptomatic cases and still showed strong agreement. 
Compared to induced or enhanced saliva and/or the use of transport media, the number of studies showing 
moderate to strong agreement was higher than the number of studies showed weak agreement. This proposed 
that simpler method of SS collection can still be carried out with considerable similar result to TSS, although 
this was limited to symptomatic condition only. 
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3.4. Sensitivity and specificity comparison of SS and TSS 

Ten studies provided information regarding sensitivity (with/without specificity) of SS 
SARS-CoV-2 RT-PCR assay in comparison to TSS. Table 2 summarized variation of results between studies. 
Of all eleven studies, four studies showed higher sensitivity of SS compared with TSS [37], [38], [40], [42], 
one of which showed significant difference [42] and the other showed concordant results in terms of 
sensitivity. Four studies showed lower sensitivity of SS [32], [34], [47], [49], all of which showed concordant 
results in terms of sensitivity, meaning no significant difference. Study conducted by Dogan et al. [43] 
performed serial specimen collection within four days (day 1 and day 5), resulted in significant difference in 
sensitivity on day 1, but with nonsignificant difference on day 5. Study conducted by Bidkar et al. [39] 
denoted significantly lower positivity rate of SS compared to the conventional NOS standards. This was 


consistent with x value of 0.252 (95% CI, 0.09-0.42) which showed weak agreement. 


Table 2. Sensitivity (SN) and specificity (SP) shown in individual studies 


Author SN & SP of SS SN & SP of throat swab specimens Additional information 
[32 SN 85.7% (95% CI, 70.6%- SN NPS 94.3% (95% CI, 81.4%-99.0%) «=0.851 (95% CI, 0.745- 
93.7%) 0.958) 
[33 SN 83.43% (95% CI, 79.07- NA «=0.814 (95% CI, 0.775- 
87.20) 0.854) 
SP 96.71% (95% CI, 94.85-98.04 
%) 
[34 SN 76.3% (95% CI) NA PPA 76.3% 
[37 SN 93% (95% CI, 0.81-0.99) SN NPS 91% (95% CI, 0.79-0.98) «=0.91 (95% CI, 0.82-0.96) 
[38 SN 53.7% (95% CI) SN NPS 47.4% (95% CI) p=0.13 
[39 SN 24.4% (95% CI). NA «=0.252 (95% CI, 0.09-0.42). 
SP 94.9% (95% CT) p=0.002 
[40 SN 78.6% (95% CI, 67.6%- SN NOS 74.2% (95% CI, 63.7%-83.1%) «=0.58 (95% CI) 
86.6%) 
[42] SN 88.09% (95% CI) SN 45.24% (95% CI) p<0.001 
[43 SN 63% (95% CI). day 1 SN NPS and NOS both 83% (95% CI). day 1 p<0.001. day 1. 
SN 55% (95% Cl). day 5 SN NPS 55% (95% CI). SN NOS 60% (95% CI). p=0.386. day 5 
day 5 
[47 SN 37.5% (95% CI, 24.2%- SN NPS 100% (95% CI, 91.2%-100.0%) «=0.225 (95% CI, 0.067- 
53.0%) SP 52.9% (95% CI, 31.0%-73.8%) 0.383) 
SP 94.1% (95% CI, 73.0%- 
99.0%) 
[49 SN 64% (95% CI) SN NPS/OPS 77% (95% CI) p=0.135 


NA=not available 


In terms of sensitivity, although some studies showed variability, only few proposed significant 
differences or diagnostic disagreement. Note that most of the studies used « value to show agreement in 
comparing sensitivity and not p value to show significance. Diagnostic agreement does not always mean the 
SS are comparable to TSS in detecting COVID-19. x value is affected by the probability for the condition to 
be present [51]. For most of these studies, the probability was quite high since most of the studies included 
COVID-19 confirmed or symptomatic suspected cases which later turned out to be confirmed case. However, 
these studies managed to indicate that SS was comparable to TSS in terms of sensitivity, especially in 
symptomatic or confirmed cases. 


3.5. Viral load comparison between SS and TSS 

Thirteen studies provided information regarding viral load (denoted as Ct value) of each specimen in 
comparison, i.e., SS and TSS. Table 3 summarized results variation between studies. Of all 13 studies, 
Ct values of SS were consistently higher than Ct value of TSS. Not all studies provided p value for 
comparisons, but the seven studies showed significant differences between these specimens’ Ct value, 
meaning the detected viral load within SS was lower than TSS. Ct value of RT-PCR represents the number of 
amplification cycles required for the target gene to exceed a threshold level, meaning it inversely related to 
viral load [52]. Specimen quality and specimen handling affected the viral load detected within specimen. 
Note that SS are intended to be self-collected to provide ease of specimen collection. Standard procedure for 
SS collection should be made clear to ensure there are enough viral components within the specimen to be 
detected. On the other hand, further research setting should accommodate less manipulated SS (i.e., by 
induced cough, or chewing, or swabbing mucosal surface) to provide the closest expected diagnostic value of 
the simpler method of SS collection. 
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Table 3. Viral load (Ct value) shown in individual studies 


Form of Ct 


Author/Year/Country value Ct value of SS Ct value of TSS Difference and/or p-value 
[31] Mean (SD) 24.16 (4.80) 20.55 (5.36) -3.61 (95% CL, -5.78 to -1.44; 
p=0.002) 

[32] Median (IQR) 24.27 (23.46 to 24.96) 22.63 (21.88 to 23.85) p=0.0331 
[33] Median NA NA p<0.05 

difference 
[34] Mean (SD) 30.97 (1.56) 27.98 (4.29) p<0.001 
[35] Mean (SD) 30.64 (2.83) 27.80 (3.44) p=0.016 
[36] Median (IQR) 31 (29-37) 26 (21-34) p<0.001 
[37] Median Higher than the NPS NA 2.76 (95% CI, 0.36-5.15; 

difference p=0.03) 
[39] Median 27.6 for E gene. 22.5 for E gene. 

27.1 for ORFlab 27.2 for ORFlab 
[41] Median (IQR) 26.10 (22.75-30.06) 18.88 (15.60-23.58) p<0.0001 
[42] Mean difference Higher than the throat NA p<0.05 
swabs 
[45] Median (IQR) 28.00 (20.08-31.00) 25.50 (17.37-36.74) p<0.05 
[47] Median 35.34 (95% CI, 29.89- 33.65 (95% CI, 23.16- NA 
38.01) 38.54) 

[49] Median (IQR) 32 (23-38) 33 (27-35) p<0.753 


NA=not available 


3.6. Strengths and limitations 

This study pointed out the differences between patients’ initial condition (symptomatic or 
asymptomatic), specimen collection method, and transport media used. To our knowledge, this was the first 
study considering all these parameters as confounders of the studies’ aim, i.e., to provide alternative 
specimens to TSS which enabled ease specimen collection method, required less transport media, and 
enabled early detection of asymptomatic cases. Limitations of this study included limited number of studies 
which included asymptomatic cases and authors’ limitation to further perform meta-analysis regarding the 
diagnostic accuracy. 


4. CONCLUSION 

The use of salivary specimens (SS) as an alternative to throat swab specimens (TSS) is promising 
although improvement in specimen collection method should be put into account. SS is comparable to TSS in 
detecting COVID-19 using RT-PCR assay. However, evidence of its comparability to TSS in asymptomatic 
cases is very much lacking since most of the studies included only symptomatic, suspected, or confirmed 
cases only. More RCTs are still needed to clearly demonstrate the ability of SS to capture asymptomatic 
cases in the setting of mass surveillance, where patients would self-collect the specimen at ease. 
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